Journal of General Internal Medicine — Latest Matching Preprints

1

Comparing prognostic performance and reasoning between large language models and physicians

Gjertsen, M.; Yoon, W.; Afshar, M.; Temte, B.; Leding, B.; Halliday, S.; Bradley, K.; Kim, J.; Mitchell, J.; Sanders, A. K.; Croxford, E. L.; Caskey, J.; Churpek, M. M.; Mayampurath, A.; Gao, Y.; Miller, T.; Kruser, J. M.

2026-04-25 intensive care and critical care medicine 10.64898/2026.04.17.26350898 medRxiv

Top 0.1%

4.7%

Show abstract

Importance: Physicians routinely prognosticate to guide care delivery and shared decision making, particularly when caring for patients with critical illnesses. Yet, these physician estimates are prone to inaccuracy and uncertainty. Artificial intelligence, including large language models (LLMs), show promise in supporting or improving this prognostication. However, the performance of contemporary LLMs in prognosticating for the heterogeneous population of critically ill patients remains poorly understood. Objective: To characterize and compare the performance of LLMs and physicians when predicting 6-month mortality for hospitalized adults who survived critical illness. Design: Embedded mixed methods study with elicitation and comparison of prognostic estimates and reasoning from LLMs and practicing physicians. Setting: The publicly available, deidentified Medical Information Mart for Intensive Care (MIMIC)-IV v2.2 dataset. Participants: We randomly selected 100 hospitalizations of adult survivors of critical illness. Four contemporary LLMs (Open AI GPT-4o, o3- and o4-mini, and DeepSeek-R1) and 7 physicians provided independent prognostic estimates for each case (1,100 total estimates; 400 LLM and 700 physician). Main outcomes and measures: For each case, LLMs and physicians used the hospital discharge summary and demographics to predict 6-month mortality (yes/no) and provide their reasoning (free text). We assessed prognostic performance using accuracy, sensitivity, and specificity, and used inductive, qualitative content analysis to characterize reasonings. Results: Mean physician accuracy for predicting mortality was 70.1% (95% CI 63.7-76.4%), with sensitivity of 59.7% (95% CI 50.6-68.8%) and specificity of 80.6% (95% CI 71.7-88.2%). The top-performing LLM (OpenAI o4-mini) accuracy was 78.0% (95% CI 70.0-86.0%), with sensitivity of 80.0% (95% CI 67.4-90.2%) and specificity of 76.0% (95% CI 63.3-88.0%). The difference between mean physician and top-performing LLM accuracy was not statistically significant (p = 0.5). Qualitative analysis revealed similar patterns in LLM and physician expressed reasoning, except that physicians regularly and explicitly reported uncertainty while LLMs did not. Conclusion and Relevance: In this study, LLMs and physicians achieved comparable, moderate performance in predicting 6-month mortality after critical illness, with similar patterns in expressed reasoning. Our findings suggest LLMs could be used to support prognostication in clinical practice but also raise safety concerns due to the lack of LLM uncertainty expression.

2

The Peripheral Use of Low-dose Vasopressors for Safety and Efficacy (PULSE) in the intensive care unit: a prospective, unblinded feasibility study protocol

Wiseman, J.; Sibley, S.; Perez-Patrigeon, S.; Mekhaeil, M.; Hanley, M.; Hunt, M.; Boyd, T.; Grant, B.; Boyd, J. G.

2026-04-20 intensive care and critical care medicine 10.64898/2026.04.13.26349750 medRxiv

Top 0.2%

4.3%

Show abstract

IntroductionThere is increasing interest in the peripheral administration of vasopressors for two main reasons: (1) to expedite vasopressor initiation in patients with refractory shock and (2) to avoid the potential complications associated with central venous catheter placement. The current evidence on the use of peripheral vasopressor administration is primarily based on single-center observational studies. There are inconsistencies in the administration of peripheral vasopressors, including catheter gauge and location, monitoring practices, vasopressor concentrations, and duration of use. This has made it difficult for institutions to develop best practice guidelines. A randomized controlled trial is needed to address this knowledge gap. Methods and analysisThe Peripheral Use of Low-dose Vasopressors for Safety and Efficacy (PULSE) in the intensive care unit is a prospective, unblinded feasibility study. Eligible patients will be 18 years or older, have no existing central venous catheter or peripherally inserted central catheter and have the presence of shock requiring a minimum vasopressor dose of any of the following: norepinephrine 0.0625 mcg/kg/min, phenylephrine 0.625 mcg/kg/min, and epinephrine 0.0625 mcg/kg/min. Fifty patients will be randomized 1:1 into either the peripheral venous catheter or central venous catheter group. The primary outcome is feasibility, defined as (1) a recruitment rate of 4 participants per month, (2) a data capture rate of [≥]90%, and (3) a <50% conversion rate from peripheral to central access. The secondary outcomes include the safety of peripheral vasopressor use, alive and central-line-free days, the number of attempts needed to place a catheter, volume status, in-hospital mortality rate, ICU and hospital length of stay, and patient-centred important outcomes. ImplicationsThe data collected from this study will inform the design of a definitive randomized controlled trial to assess the safety and efficacy of protocol-driven peripheral vasopressor administration. Ethics and disseminationThis study received approval (6042888) from the Queens University Health Sciences/Affiliated Teaching Hospitals Research Ethics Boards. Results of this study will be presented at critical care conferences and submitted for publication. Trial registration numberNCT06920173 (https://clinicaltrials.gov/study/NCT06920173).

3

Influenza vaccine effectiveness against influenza-associated hospitalizations and emergency department or urgent care encounters among children and adults - United States, 2024-25 season

DeCuir, J.; Reeves, E. L.; Weber, Z. A.; Yang, D.-H.; Irving, S. A.; Tartof, S. Y.; Klein, N. P.; Grannis, S. J.; Ong, T. C.; Ball, S. W.; DeSilva, M. B.; Dascomb, K.; Naleway, A. L.; Koppolu, P.; Salas, S. B.; Sy, L. S.; Lewin, B.; Contreras, R.; Zerbo, O.; Hansen, J. R.; Block, L.; Jacobson, K. B.; Dixon, B. E.; Rogerson, C.; Duszynski, T.; Fadel, W. F.; Barron, M. A.; Mayer, D.; Chavez, C.; Yates, A.; Kirshner, L.; McEvoy, C. E.; Akinsete, O. O.; Essien, I. J.; Sheffield, T.; Bride, D.; Arndorfer, J.; Van Otterloo, J.; Natarajan, K.; Ray, C. S.; Payne, A. B.; Adams, K.; Flannery, B.; Garg,

2026-04-24 public and global health 10.64898/2026.04.22.26350853 medRxiv

Top 0.2%

3.3%

Show abstract

Background: The 2024-25 influenza season was the most severe in the United States (US) since 2017-18, with co-circulation of both influenza A virus subtypes (H1N1 and H3N2). Influenza vaccine effectiveness (VE) has varied by season, setting, and patient characteristics. Methods: Using electronic healthcare encounter data from eight US states, we evaluated influenza vaccine effectiveness (VE) against influenza-associated hospitalizations and emergency department or urgent care (ED/UC) encounters from October 2024-April 2025 among children aged 6 months-17 years and adults aged 18+ years. Using a test-negative, case-control design, we compared the odds of influenza vaccination between acute respiratory illness (ARI) encounters with a positive (cases) versus negative (controls) test for influenza by molecular assay, adjusting for confounders. Results: Analyses included 108,618 encounters (5,764 hospitalizations and 102,854 ED/UC encounters) among children and 309,483 encounters (76,072 hospitalizations and 233,411 ED/UC encounters) among adults. Among children across care settings, 17.0% (6,097/35,765) of cases versus 29.4% (21,449/72,853) of controls were vaccinated. Among adults, 28.2% (21,832/77,477) of cases versus 44.2% (102,560/232,006) of controls were vaccinated. VE was 51% (95% confidence interval [95% CI]: 41-60%) against influenza-associated hospitalizations and 54% (95% CI: 52-55%) against influenza-associated ED/UC encounters among children. VE was 43% (95% CI: 41-46%) against influenza-associated hospitalizations and 49% (95% CI: 47-50%) against influenza-associated ED/UC encounters among adults. Conclusions: Influenza vaccination provided protection against influenza-associated hospitalizations and ED/UC encounters among children and adults in the US during the severe 2024-25 influenza season. These findings support influenza vaccination as an important tool to reduce influenza-associated disease.

4

A rights-based intervention integrating social work and ophthalmic care for people experiencing or at risk of homelessness

Hassani, A.; Pecar, K.; Soliman, M.; Bunyon, P.; Ellinger, C.; Tulysewskid, G.; Croft, J.; Carillo, C.; Wewegama, G.; du Plessis-Schneider, S.; Estevez, J. J.

2026-04-24 public and global health 10.64898/2026.04.22.26351525 medRxiv

Top 0.3%

3.1%

Show abstract

Background Individuals experiencing or at risk of homelessness face substantial barriers to preventive eye care that are poorly addressed by standard service models. Interdisciplinary optometry-social work collaboration offers a rights-based approach to improving engagement and continuity of care. Methods A convergent mixed-methods study was conducted between February and August 2024 at a multidisciplinary community centre. Clients experiencing or at risk of homelessness received integrated optometry and social work assessment and were prioritised as high, medium, or low based on combined clinical and social risk. Social work follow-up was guided by the Triple Mandate and W-Questions framework. Quantitative data were summarised using mean (SD), median [IQR], or n (%). Qualitative case notes were analysed using content analysis with inductive coding and secondary review for consistency. Results A total of 165 clients had priority categories coded (high: 68; medium: 47; low: 154). Demographic data were available for 132 clients (60% male; mean age 49.5 years [SD 16]); 27% had not completed high school, 89% reported weekly income below AUD 1000, and 28% had vision impairment. Two hundred forty-five case-note entries were consolidated into 146 unique records. SMS (46%) and phone calls (38%) were the most documented contact methods, although only 21% of calls were answered; missed calls (13%) and disconnected numbers (7%) were common. Multi-modal contact was more frequently documented for higher-priority clients. Appointment assistance was the most recorded facilitator (71%), while rights-based supports, including interpreter and transport assistance, were infrequently documented (<=5%). Qualitative analysis identified unstable communication, reliance on informal supports, and service fragmentation as key influences on recall outcomes. Conclusion This study supports an interdisciplinary, rights-based optometry-social work model to address barriers to preventive eye care among people experiencing or at risk of homelessness. Embedding structured handovers and tiered recall processes within community-based services may strengthen continuity and accountability for high-priority clients. Future implementation should evaluate outcomes related to equity of reach, service integration, and sustained engagement in care.

5

Global burden of stigma and discrimination against transgender and gender-diverse adults: a systematic review and meta-analysis

Barre-Quick, M.; Yeh, P. T.; Kennedy, C. E.; Azuma, H.; McLellan, C.; Cooney, E. E.

2026-04-23 public and global health 10.64898/2026.04.22.26351490 medRxiv

Top 0.4%

2.1%

Show abstract

Abstract Importance Stigma and discrimination against transgender and gender-diverse people are prevalent across many settings and may contribute to substantial health disparities. Objective To synthesize global evidence on the prevalence of stigma, discrimination, and resilience among transgender (trans) and gender-diverse adults. Data Sources A systematic search was conducted in PubMed, Embase, CINAHL, Cochrane Central, LILACS, and PsycInfo for articles published between January 1, 2010 and January 2, 2023. This database search was supplemented by grey literature and secondary reference searches. Article Selection Studies were eligible if they presented primary quantitative data on prevalence of stigma, discrimination, and/or resilience among trans and gender-diverse adults (aged 18 and over), with no restrictions on study design, language, or geographic region. Data Extraction and Synthesis Two independent reviewers extracted data using standardized forms, with discrepancies resolved by consensus. The JBI Critical Appraisal Checklist for Prevalence Articles was used to assess risk of bias. Random effects meta-analysis was conducted for dichotomous prevalence measures using inverse variance weighting and logit transformation; non-dichotomous prevalence data were summarized descriptively. Main Outcomes and Measures Outcomes included prevalence estimates for various forms of stigma (anticipated, perceived, internalized, and experienced), discrimination in legal/institutional settings (housing, healthcare, employment, police/prison), and resilience. Results A total of 97 articles, with data from 72,158 unique trans and gender-diverse participants across 26 countries, met inclusion criteria. Studies showed moderate levels of anticipated stigma, perceived stigma, and internalized stigma. Meta-analyses of 36 studies provided pooled estimates of discrimination prevalence across multiple domains: 21.4% in housing (e.g., eviction, rental denial), 24.6% in healthcare (e.g., denial of care, mistreatment), 32.8% in employment (e.g., hiring bias, workplace harassment), and 39.1% in police/prison settings (e.g., profiling, mistreatment). High heterogeneity was observed across studies, reflecting regional and methodological differences. Resilience scores ranged from moderate to high, indicating variation within trans and gender-diverse communities. Conclusions and Relevance This systematic review and meta-analysis found that stigma and discrimination against trans and gender-diverse adults are pervasive globally. Variation in stigma and discrimination across settings and regions underscores the need for targeted interventions and policy reforms. Funding World Health Organization through a grant from the Elton John AIDS Foundation and the Bill and Melinda Gates Foundation.

6

The Golden Opportunity or the Cutting Room Floor? Quantifying and Characterizing the Loss and Addition of Social Determinants of Health during Clinician Editing of Ambient AI Documentation

Kim, S.; Guo, Y.; Sutari, S.; Chow, E.; Tam, S.; Perret, D.; Pandita, D.; Zheng, K.

2026-04-22 health systems and quality improvement 10.64898/2026.04.20.26351322 medRxiv

Top 0.4%

1.9%

Show abstract

Social determinants of health (SDoH) are important for clinical care, but it remains unclear how much AI-captured social context is preserved after clinician editing in ambient documentation workflows. We retrospectively analyzed 75,133 paired ambient AI-drafted and clinician-finalized note sections from ambulatory care at a large academic health system. Using a rule-based NLP pipeline, we extracted 21 SDoH categories and quantified retention, deletion, and addition. SDoH appeared in 25.2% of AI drafts versus 17.2% of final notes. At the mention level, AI captured 29,991 SDoH mentions, of which 45.1% were deleted, 54.9% were retained with clinicians adding 3,583 new mentions. Insurance and marital status were most often deleted, whereas substance use and physical activity were more often retained. Deletion patterns also varied by specialty, supporting the need for specialty-aware ambient AI systems.

7

Cardiac Rehabilitation and Functional Capacity Improvement: Montana Outcomes Project Cardiac Rehabilitation Registry Findings

Claus, L.; McNamara, M.; Oser, C.; Fogle, C.; Canine, B.

2026-04-21 public and global health 10.64898/2026.04.20.26351126 medRxiv

Top 0.6%

1.4%

Show abstract

Cardiovascular disease (CVD) remains the leading cause of mortality in the United States, despite being largely preventable through effective management of risk factors. This study evaluates the impact of Phase II cardiac rehabilitation (CR) on functional capacity and quality of life, using data from the Montana Outcomes Project Cardiac Rehabilitation Registry. Functional capacity improvements were assessed via the six-minute walk test (6MWT) and Dartmouth COOP questionnaire, with statistical analyses exploring the influence of CR session attendance, demographic factors, and referring diagnoses. Results demonstrated significant gains in 6MWT, with a mean improvement of 330.73 feet (p < .0001), and quality of life scores across all subgroups. A dose-response relationship was observed, indicating greater improvements with increased CR sessions (p < .0001), though diminishing returns were observed beyond 24-35 visits. Demographic factors and complex conditions influenced outcomes, underscoring the need for tailored strategies to enhance CR access and effectiveness. These findings highlight the critical role of CR in improving patient outcomes and emphasize the importance of addressing barriers to participation in underserved populations.

8

Family Constellations for All Clinical Conditions: A Systematic Review and Meta-analysis Showing a Lack of Supporting Evidence

Souza, F. L.; Cabral Souza, N.; Mendes, J. A. d. A.

2026-04-21 psychiatry and clinical psychology 10.64898/2026.04.19.26351231 medRxiv

Top 0.7%

1.2%

Show abstract

IntroductionFamily Constellation Therapy (FCT) has been widely disseminated in clinical, public health, and judicial settings despite persistent concerns regarding its theoretical basis, safety, and the limited availability of rigorous randomised evidence supporting its clinical use. ObjectiveThe aim of this systematic review is to assess the effects of FCT across all clinical conditions, explicitly considering both benefits and harms; and summarise the characteristics of studies and intervention settings used in randomised controlled trials of FCT. MethodsFollowing a prospectively registered protocol (CRD420251136190), we conducted a systematic search of seven databases (PubMed, EMBASE, APA PsycInfo, CENTRAL, BVS, Web of Science, and CINAHL) and grey literature (ICTRP and ProQuest database) without language or date restrictions to identify published and unpublished randomised controlled trials of FCT. Study selection, data extraction, risk of bias (RoB 2), and certainty of evidence (GRADE) were performed in duplicate. Statistical analyses followed a prospectively registered analysis plan with prespecified criteria for data pooling and for handling analytical limitations. ResultsNo reliable evidence was found to support the use of FCT for any condition across both clinical and non-clinical samples. All trials included were judged to be at high risk of bias and all comparisons were rated as very low-certainty evidence. Concerns regarding potential adverse effects were identified, and the available data was insufficient to establish the effectiveness of the intervention, precluding any clinical recommendation. ConclusionClinicians, policymakers, and consumers should reconsider adopting FCT while reliable evidence is not available.

9

Leveraging Predictive AI and LLM-Powered Trial Matching to Improve Clinical Trial Recruitment: A Usability Assessment of Trialshub

Blankson, P.-K.; Hussien, S.; Idris, F.; Trevillion, G.; Aslam, A.; Afani, A.; Dunlap, P.; Chepkorir, J.; Melgarejo, P.; Idris, M.

2026-04-20 health informatics 10.64898/2026.04.17.26351107 medRxiv

Top 0.7%

1.2%

Show abstract

BackgroundRecruitment remains a major barrier to timely clinical trial completion. Trialshub is an LLM-powered, chat-based platform intended to help users identify relevant trials and connect with coordinators to streamline recruitment workflows. ObjectiveTo evaluate the perceived usability and operational value of Trialshub, and identify implementation considerations for real-world deployment. MethodsA usability test was conducted at Morehouse School of Medicine for the Trialshub application. Purposively selected participants included clinical research coordinators and individuals with and without clinical trial search experience. Participants completed a pre-test survey assessing demographics, digital health information behaviors, and familiarity with AI tools, followed by a moderated usability session using a Trialshub prototype. Users completed scenario-based tasks (locating a breast cancer trial, reviewing results, and initiating coordinator contact) using a think-aloud protocol. Task ratings, screen recordings, and transcribed feedback were analyzed descriptively and thematically, and reported. ResultsParticipants reported high comfort with using digital tools and moderate-to-high familiarity with AI. Trialshubs chat-first design, guided prompts, and checklist-style eligibility display were perceived as intuitive and reduced cognitive load. Fast access to trials and the coordinator-contact workflow were viewed positively. Key usability issues included uncertainty at step transitions, insufficient cues for selecting results and next actions, and inconsistent system reliability (loading delays, errors, and broken trial detail pages). Participants also noted redundant questioning due to limited conversational memory, requested improved filtering/sorting, and clearer calls-to-action. All participants indicated that Trialshub has strong potential to meaningfully improve clinical trial processes. ConclusionsTrialshub shows promise for improving trial discovery and recruitment workflows, with identified design implications for real-world deployment.

10

Inclusive Biology Curriculum Interventions Can Reduce High School Students' Bioessentialist Beliefs

Blake, C. K.; Ewa, O. S.; Eckles, E. B.

2026-04-19 scientific communication and education 10.64898/2026.04.16.719004 medRxiv

Top 0.8%

0.9%

Show abstract

Lesbian, gay, bisexual, transgender, queer, intersex, and asexual (LGBTQIA+) students continue to face violence, exclusion, and barriers at school, including in STEM education. A key underexamined factor in diversity, equity, and inclusion (DEI) efforts is the content of the life science curriculum, which is uniquely positioned to reinforce or refute bioessentialist, binary, and heteronormative biases. Outdated science curricula not only conflict with current scientific evidence but can also perpetuate beliefs that contribute to sexism and LGBTQIA+ marginalization. To address this, we designed four gender and sexual diversity (GSD)-inclusive biology activities, aligned with NGSS standards, and informed by inclusive curriculum frameworks. Using a mixed-methods approach, we studied 127 high school students who participated in two or more inclusive biology activities. Surveys conducted before and after implementation showed significant reductions in essential, binary beliefs about sex and gender, and increases in affirming attitudes toward sex and gender diversity. Interviews conducted after implementation further revealed differences between LGBTQIA+ and straight students conceptualizations of biological sex. Our findings demonstrate that even brief curriculum interventions can shift student attitudes, although we hope future studies will explore the impact of sustained interventions. Updating life science instruction is essential for educational equity and scientific accuracy.

11

A fully remote randomized controlled trial of an ultra-brief digital meditation intervention reduces internalizing symptoms

Glick, C. C.; Pirzada, S. T.; Quah, S. K.; Feldman, S.; Enabulele, I.; Madsen, S.; Billimoria, N.; Feldman, S.; Bhatia, R.; Spiegel, D.; Saggar, M.

2026-04-21 psychiatry and clinical psychology 10.64898/2026.04.19.26351219 medRxiv

Top 0.9%

0.8%

Show abstract

BackgroundScalable, low-burden behavioral interventions are needed to address rising subclinical mental health symptoms. However, few randomized controlled trials have evaluated ultra-brief, remotely delivered, meditation using multimodal outcome assessment under real-world conditions. MethodsWe conducted a fully remote randomized controlled trial (ClinicalTrials.gov: NCT06014281) evaluating a focused-attention meditation intervention delivered via brief instructor training and independent daily practice. A total of 299 meditation-naive adults were randomized to immediate intervention or waitlist control in a delayed-intervention design. Participants practiced [≥]10 minutes daily for 8 weeks within a 16-week study. Outcomes included validated self-report measures, web-based cognitive tasks, and wearable-derived physiological metrics. ResultsAcross randomized and within-participant replication phases, the intervention was associated with significant reductions in anxiety and mind wandering, with effects remaining stable during 8-week follow-up. Improvements were greatest among participants with higher baseline symptom burden. Sleep disturbance improved selectively among individuals with poorer baseline sleep. Secondary outcomes, including rumination, perceived stress, social connectedness, and quality of life, also improved. Cognitive performance showed modest improvements primarily among lower-performing participants. Resting heart rate exhibited nominal reductions. ConclusionsAn ultra-brief, fully remote meditation intervention requiring 10 minutes per day was associated with sustained improvements in psychological functioning and smaller, baseline-dependent effects on cognition in a non-clinical population. These findings support digital delivery of low-dose meditation as a scalable preventive mental health strategy.

12

Addition of Bupropion or Varenicline to Nicotine Replacement Therapy After Acute Coronary Syndrome: A Propensity-Matched Real-World Analysis

Qadeer, A.; Gohar, N.; Maniyar, P.; Shafi, N.; Juarez, L. M.; Mortada, I.; Pack, Q. R.; Jneid, H.; Gaalema, D. E.

2026-04-23 cardiovascular medicine 10.64898/2026.04.21.26351432 medRxiv

Top 0.9%

0.8%

Show abstract

Introduction: Smoking cessation after acute coronary syndrome (ACS) is a Class I recommendation, yet prescription pharmacotherapy use remains low and its real-world cardiovascular effectiveness when added to nicotine replacement therapy (NRT) is poorly characterized. Methods: We conducted a retrospective cohort study using the TriNetX US Collaborative Network (67 healthcare organizations). Adults hospitalized with ACS who received NRT within one month, serving as a proxy for active smoking status, were identified. Two co-primary propensity-matched (1:1, 50 covariates, caliper 0.10 SD) comparisons evaluated bupropion + NRT and varenicline + NRT individually versus NRT alone; a supportive analysis evaluated combined pharmacotherapy versus NRT alone. All-cause mortality was the primary endpoint. Secondary outcomes included MACE, heart failure exacerbations, major bleeding, TIA/stroke, emergency rehospitalizations, and cardiac rehabilitation utilization, assessed at 6 months and 1 year via Kaplan-Meier analysis. Hazard ratios (HRs) greater than 1.0 indicate higher hazard in the NRT-only group. Results: After matching, the combined analysis comprised 8,574 pairs, the bupropion analysis 4,654 pairs, and the varenicline analysis 2,126 pairs. At 1 year, the combined pharmacotherapy group had significantly lower all-cause mortality (HR 1.26, 95% CI 1.16-1.37), MACE (HR 1.16, 95% CI 1.12-1.21), heart failure exacerbations (HR 1.16, 95% CI 1.08-1.25), major bleeding (HR 1.18, 95% CI 1.08-1.28), and greater cardiac rehabilitation utilization (HR 0.82, 95% CI 0.74-0.92; all p < 0.001). TIA/stroke did not differ significantly. Six-month results were consistent. Both varenicline and bupropion individually showed lower mortality and MACE. A urinary tract infection falsification endpoint showed no between-group differences, supporting matching validity. The pharmacotherapy group had higher rates of new-onset depression, driven predominantly by bupropion recipients. Conclusions: In this propensity-matched real-world analysis, adding prescription smoking cessation pharmacotherapy to NRT after ACS was associated with lower mortality and fewer adverse cardiovascular events, supporting broader integration into post-ACS care pathways.

13

Improving Care by FAster risk-STratification through use of high sensitivity point-of-care troponin in patients presenting with possible acute coronary syndrome in the EmeRgency department (ICare-FASTER): a stepped-wedge cluster randomized trial

Than, M.; Pickering, J. W.; Joyce, L. R.; Buchan, V. A.; Florkowski, C. M.; Mills, N. L.; Hamill, L.; Prystowsky, J.; Harger, S.; Reed, M.; Bayless, J.; Feberwee, A.; Attenburrow, T.; Norman, T.; Welfare, O.; Heiden, T.; Kavsak, P.; Jaffe, A. S.; apple, f.; Peacock, W. F.; Cullen, L.; Aldous, S.; Richards, A. M.; Lacey, C.; Troughton, R.; Frampton, C.; Body, R.; Mueller, C.; Lord, S. J.; George, P. M.; Devlin, G.

2026-04-23 cardiovascular medicine 10.64898/2026.04.21.26351433 medRxiv

Top 0.9%

0.8%

Show abstract

BACKGROUND Point-of-care (POC) high-sensitivity cardiac troponin (hs-cTn) testing has the potential to expedite decision-making and reduce emergency department (ED) length of stay for patients presenting with possible myocardial infarction (MI) by ensuring that results are consistently available when looked for by clinicians. We assessed the real-life effectiveness and safety of implementing POC hs-cTn testing in the ED. METHODS We conducted a pragmatic, stepped-wedge cluster randomized trial. The control arm was usual care with an accelerated diagnostic pathway utilizing a single-sample rule-out step with a central laboratory hs-cTn assay. The intervention arm used the same pathway with a POC hs-cTnI. The primary effectiveness outcome was ED length of stay assessed using a generalized linear mixed model, and the safety outcome was 30-day MI or cardiac death. RESULTS Six sites participated with 59,980 ED presentations (44,747 individuals, 61{+/-}19 years, 49.5% female) from February 2023 to January 2025, in which 31,392 presentations were during the intervention arm. After adjustment for co-variates associated with length of stay, the intervention reduced length of stay by 13% (95% confidence intervals [CI], 9 to 16%. P<0.001), corresponding to a reduction of 47 minutes (95%CI, 33 to 61 minutes) from a mean length of stay in the control arm of 376 minutes. The 30-day MI or cardiac death rate was similar in the control and intervention arms (0.39% and 0.39% respectively, P=0.54). CONCLUSIONS Implementation of whole-blood hs-cTnI testing at the POC into an accelerated diagnostic pathway was safe and reduced length of stay in the ED compared with laboratory testing.

14

Evolving concerns about the COVID-19 pandemic: A content analysis of free-text reports from the UK COVID-19 Public Experiences (COPE) study cohort over a two-year period

Phillips, R.; Wood, F.; Torrens-Burton, A.; Glennan, C.; Sellars, P.; Lowe, S.; Caffoor, A.; Hallingberg, B.; Gillespie, D.; Shepherd, V.; Poortinga, W.; Wahl-Jorgensen, K.; Williams, D.

2026-04-19 public and global health 10.64898/2026.04.16.26351013 medRxiv

Top 1.0%

0.8%

Show abstract

Objectives Concerns about COVID-19 were a key driver of infection-prevention behaviour during the pandemic. The aim of this study was to gain an in-depth longitudinal understanding of the type and frequency of concerns experienced throughout the first two years of the COVID-19 pandemic. Design Content analysis of qualitative descriptions provided in a prospective longitudinal online survey as part of the COVID-19 UK Public Experiences (COPE) Study. Method At baseline (March/April 2020), when the UK entered its first national lockdown, 11,113 adults completed the COPE survey. Follow-up surveys were conducted at 3, 12, 18 and 24 months. Participants were recruited via the HealthWise Wales research registry and social media. Baseline surveys collected demographic and health data, and all waves included an open-ended question about COVID-19 concerns. Content analysis was used to identify the type and frequency of concerns at each time point. Results A total of 41,564 open-text responses were coded into six categories: personal harm (n=16,353), harm to others (n=11,464), social/economic impact (n=6,433), preventing transmission (n=4,843), government/media (n=1,048), and general concerns (n=1,423). The proportion of respondents reporting any concern declined from 75.3% at baseline to 65.8% at 24 months. Over time, concerns about personal harm increased (baseline 41.8% vs. 24-months 52.7%) whereas concerns about harm to others decreased (baseline 48.5% vs. 24-months 28.6%). Concerns about harm were also expressed in relation to clinical vulnerability, lack of trust in government/media, and perceived lack of adherence by others. These were balanced against concerns about wider social and economic impacts of restrictions. Conclusions Public concerns about COVID-19 evolved substantially over the first two years of the pandemic, reflecting changing perceptions of risk and responsibility. Monitoring concerns longitudinally is vital to help guide effective communication and behavioural interventions during future pandemics.

15

Recovering Clinical Detail in AI-Generated Responses for Low Back Pain Through Prompt Design

Basharat, A.; Hamza, O.; Rana, P.; Odonkor, C. A.; Chow, R.

2026-04-23 pain medicine 10.64898/2026.04.21.26351437 medRxiv

Top 1.0%

0.8%

Show abstract

Introduction Large language models are increasingly being used in healthcare. In interventional pain medicine, clinical reasoning is essential for procedural planning. Prior studies show that simplified prompts reduce clinical detail in AI-generated responses. It remains unclear whether this reflects knowledge loss or simply prompt-driven suppression of information. Methods We performed a controlled comparative study using 15 standardized low back pain questions representing common interventional pain questions. Each question was submitted to ChatGPT under three conditions, professional-level prompt (DP), fourth-grade reading-level prompt (D4), and clinician-directed rewriting of the D4 response to a medical level (U4[->]MD). No follow-up prompting was allowed. Three physicians independently rated responses for accuracy using a 0-2 ordinal scale. Clinical completeness was determined by consensus. Word count and Flesch-Kincaid Grade Level (FKGL) were also measured. Paired t-tests compared conditions. Results Accuracy was highest with professional prompting (1.76). Accuracy declined with the fourth-grade prompt (1.33; p = 0.00086). When simplified responses were rewritten for clinicians, accuracy returned to baseline (1.76; p {approx} 1.00 vs DP). Clinical completeness followed the same pattern showing DP 80.0%, D4 6.7%, U4[->]MD 73.3%. Fourth-grade responses were shorter and less complex. Upscaled responses were more complex and similar in length to professional responses. Inter-rater reliability was low (Fleiss {kappa} = 0.17), but trends were consistent across conditions. Conclusions Reduced clinical detail under simplified prompts appears to reflect constrained output rather than loss of knowledge. Clinician-directed reframing restores omitted content. LLM performance in interventional pain depends strongly on prompt design and intended audience.

16

MIMIC-IV-Phenotype-Atlas (MIPA) : A Publicly Available Dataset for EHR Phenotyping

Yamga, E.; Goudrar, R.; Despres, P.

2026-04-24 health informatics 10.64898/2026.04.16.26350888 medRxiv

Top 1%

0.7%

Show abstract

Introduction Secondary use of electronic health records (EHRs) often requires transforming raw clinical information into research-grade data. A central step in this process is EHR phenotyping - the identification of patient cohorts defined by specific medical conditions. Although numerous approaches exist, from ICD-based heuristics to supervised learning and large language models (LLMs), the field lacks standardized benchmark datasets, limiting reproducibility and hindering fair comparison across methods. Methods We developed the MIMIC-IV Phenotype Atlas (MIPA) dataset, an adaptation of MIMIC-IV that provides expert-annotated discharge summaries across 16 phenotypes of varying prevalence and complexity. Two independent clinicians reviewed and labeled the discharge summaries, resolving disagreements by consensus. In parallel, we implemented a processing pipeline that extracts multimodal EHR features and generates training, validation, and testing datasets for supervised phenotyping. To illustrate MIPA's utility, we benchmarked four phenotyping methods : ICD-based classifiers, keyword-driven Term Frequency-Inverse Document Frequency (TF-IDF) classifiers, supervised machine learning (ML) models, and LLMs on the task. Results The final MIPA corpus consists of 1,388 expert-annotated discharge summaries. Annotation reliability was high (mean document-level kappa = 0.805, mean label-level kappa = 0.771), with 91% of disagreements resolved through consensus review. MIPA provides high-quality phenotype labels paired with structured EHR features and predefined train/validation/test splits for each phenotype. In the benchmarking case study, LLMs achieved the highest F1 scores in 13 of 16 phenotypes, particularly for conditions requiring contextual interpretation of clinical narrative, while supervised ML offered moderate improvements over rule-based baselines. Conclusion MIPA is the first publicly available benchmark dataset dedicated to EHR phenotyping, combining expert-curated annotations, broad phenotype coverage, and a reproducible processing pipeline. By enabling standardized comparison across ICD-based heuristics, ML models, and LLMs, MIPA provides a durable reference resource to advance methodological development in automated phenotyping.

17

A systematic review and meta-analysis of the efficacy and safety of pharmacological treatments for obesity in adults: 2026 Update

Ciudin Mihai, A.; Baker, J. L.; Belancic, A.; Busetto, L.; Dicker, D.; Fabryova, L.; Fruhbeck, G.; Goossens, G. H.; Gordon, J.; Monami, M.; Sbraccia, P.; Martinez Tellez, B.; Yumuk, V.; McGowan, B.

2026-04-24 endocrinology 10.64898/2026.04.19.26351196 medRxiv

Top 1%

0.7%

Show abstract

This updated systematic review and network meta-analysis evaluated the efficacy and safety of obesity management medications (OMMs) in terms of reducing body weight and obesity related complications. Medline and Embase were searched up to 21 November 2025 for randomized controlled trials comparing OMMs versus placebo or active comparators in adults. The primary endpoint was percentage total body weight loss (TBWL%) at the end of the study. Secondary endpoints were TBWL% at 1, 2 and 3 years, anthropometric, metabolic, mental health and quality of life outcomes, cardiovascular morbidity and mortality, remission of obesity related complications, serious adverse events and all cause mortality. Sixty six RCTs (66 comparisons) were identified: orlistat (22), semaglutide (18), liraglutide (11), tirzepatide (8), naltrexone/bupropion (5) and phentermine/topiramate (2), enrolling 63,909 patients (34,861 and 29,048 with active compound and placebo, respectively). All OMMs showed significantly greater TBWL% versus placebo; tirzepatide and semaglutide exceeded 10% TBWL and showed the most favourable glycaemic effects. Semaglutide reduced major adverse cardiovascular events and all cause mortality. In dedicated complication specific trials, semaglutide and tirzepatide showed benefit on heart failure related outcomes; tirzepatide was associated with improved obstructive sleep apnoea syndrome and semaglutide with knee osteoarthritis pain remission. Tirzepatide and semaglutide were associated with improvements in metabolic dysfunction-associated steatohepatitis remission, and semaglutide with improvement in liver fibrosis. No OMMs were associated with an increased risk of serious adverse events. These updated results reinforce the need to individualize OMMs selection according to weight loss efficacy, complication profile and safety.

18

The Visual Hemofilter: a novel visualization technology that improves task performance among intensive care professionals: A prospective simulation study.

Bider-Lunkiewicz, J.; Gasciauskaite, G.; Rück Perez, B.; Braun, J.; Willms, J.; Szekessy, H.; Nöthiger, C.; Hoffmann, M.; Milovanovic, P.; Keller, E.; Tscholl, D. W.

2026-04-20 intensive care and critical care medicine 10.64898/2026.04.16.26351012 medRxiv

Top 1%

0.5%

Show abstract

PurposeThis study evaluates the Visual Hemofilter, a novel decision-support and information transfer tool designed to assist with regional citrate anticoagulation (RCA) in hemofiltration. By representing hemofilter parameters and patient blood constituents as animated icons, the tool aims to improve clinicians interpretation of blood gas results and RCA reference tables. We hypothesized that the Visual Hemofilter would enhance clinical decision-making by enabling faster and more accurate therapy adjustments, increasing clinicians confidence in their decisions, and reducing cognitive workload compared to conventional methods. MethodsWe conducted a prospective, randomized, computer-based simulation study across four intensive care units at the University Hospital Zurich. Twenty-six critical care professionals participated, each managing regional citrate anticoagulation (RCA) scenarios using either the Visual Hemofilter or conventional methods involving blood gas analysis and reference tables. Following each scenario, participants made therapy adjustments and rated their decision confidence and cognitive workload. ResultsUse of the Visual Hemofilter significantly improved decision accuracy (odds ratio [OR] 3.96; 95% CI 2.03-7.73; p < 0.0001) and reduced decision time by an average of 33 seconds (mean difference -33.3 seconds; 95% CI -39.4 to -27.2; p < 0.0001). Participants also reported greater confidence in their decisions (OR 5.41; 95% CI 2.49-11.77; p < 0.0001) and experienced lower cognitive workload (mean difference -15.05 points on the NASA-TLX scale (National Aeronautics and Space Administration-Task Load Index); 95% CI -18.99 to -11.13; p < 0.0001). ConclusionsThe Visual Hemofilter enhances clinical decision-making in RCA by increasing accuracy and speed, boosting decision confidence, and reducing cognitive workload. This technology has the potential to reduce errors and better support critical care professionals in managing complex treatment scenarios.

19

Digital Therapeutic for Hwa-byung Based on Acceptance and Commitment Therapy: A Pilot Feasibility Trial

Kwon, C.-Y.; Lee, B.; Kim, M.; Mun, J.-h.; Seo, M.-G.; Yoon, D.

2026-04-22 psychiatry and clinical psychology 10.64898/2026.04.19.26351203 medRxiv

Top 1%

0.5%

Show abstract

BackgroundHwa-byung (HB) is a Korean culture-bound syndrome characterised by prolonged suppression of anger and somatic complaints. No evidence-based digital therapeutic (DTx) has been developed for HB. We evaluated the feasibility, user experience (UX), and preliminary clinical effect of an acceptance and commitment therapy (ACT)-based DTx application, Hwa-free, for HB. MethodsAdults aged 19-80 years diagnosed with HB were enrolled in a four-week app-based intervention with assessment at baseline (Week 0), Week 2, Week 4, and Week 8 follow-up. The primary outcome was UX assessed via a 22-item survey at Week 4. Secondary outcomes included HB-related symptom and personality scales, depression, anxiety, anger expression, psychological flexibility, health-related quality of life, and heart rate variability. ResultsOf 45 screened, 30 were enrolled and 28 constituted the modified intention-to-treat population. Mean app use was 19.9 {+/-} 7.9 days (71.2% adherence over 28 days). Adverse events were infrequent and unrelated to the intervention. Positive response rates exceeded 80% for video content (items 2-4: 82.8-89.7%), HB self-assessment (86.2%), meditation therapy (86.2%), and in-app guidance (85.7%). Pre-post improvements from baseline to Week 4 were observed in 11 of 18 clinical scales, including HB Symptom Scale ({Delta} = -9.8, Cohens d = -0.92), Beck Depression Inventory-II ({Delta} = -13.3, d = -1.11), and state anger ({Delta} = -7.8, d = -0.96). The HB screening-positive rate declined from 100% at baseline to 55.6% at Week 8. ConclusionsHwa-free demonstrated adequate feasibility, acceptable UX, and preliminary evidence of clinically meaningful improvement in HB-related symptoms. Future randomised controlled trial is warranted. Trial registrationCRIS, KCT0011105

20

Patterns of maternal transport in a state with levels of maternal care and no formal perinatal regions

Li, J.; Steimle, L. N.; Carrel, M.; Byrd, R. A.; Radke, S. M.

2026-04-22 health systems and quality improvement 10.64898/2026.04.20.26351263 medRxiv

Top 1%

0.5%

Show abstract

PurposeTo characterize maternal transport patterns in Iowa, a state with levels of maternal care and without formal perinatal regions, and assess whether transport decisions reflect efficient, risk-appropriate coordination. MethodsWe analyzed 2010-2023 Iowa birth records, which included 2,251 maternal transports between obstetric facilities across 106 unique routes. We characterized transport patterns and applied a community detection algorithm to identify "communities" of obstetric facilities that disproportionately transport among themselves. FindingsSuburban and rural counties have elevated transport rates compared to urban counties. 2,189 transports (97%) were from lower-to higher-level facilities. Among these, 2,037 (93%) were to Level III tertiary care centers. 567 transports (25.2%) bypassed a closer facility offering an equivalent or higher level of care than its destination facility. Health system affiliation was associated with bypassing transport, indicating potential organizational rather than purely geographic drivers of transport decisions. Three "communities" of obstetric facilities largely shaped by geographic proximity were identified. ConclusionsAlthough Iowa does not have formal perinatal regions, patterns of maternal transport are mostly in line with three de facto regions. Some potential inefficiencies were identified, such as obstetric facilities transporting to a farther facility when a closer facility offered the same level of care or higher. These findings may help identify opportunities to enhance care coordination among obstetric facilities, optimize maternal transport networks, and improve regionalization of maternal care.